Probabilistic modeling of bifurcations in single-cell gene expression data using a Bayesian mixture of factor analyzers
نویسندگان
چکیده
Modeling bifurcations in single-cell transcriptomics data has become an increasingly popular field of research. Several methods have been proposed to infer bifurcation structure from such data, but all rely on heuristic non-probabilistic inference. Here we propose the first generative, fully probabilistic model for such inference based on a Bayesian hierarchical mixture of factor analyzers. Our model exhibits competitive performance on large datasets despite implementing full Markov-Chain Monte Carlo sampling, and its unique hierarchical prior structure enables automatic determination of genes driving the bifurcation process. We additionally propose an Empirical-Bayes like extension that deals with the high levels of zero-inflation in single-cell RNA-seq data and quantify when such models are useful. We apply or model to both real and simulated single-cell gene expression data and compare the results to existing pseudotime methods. Finally, we discuss both the merits and weaknesses of such a unified, probabilistic approach in the context practical bioinformatics analyses.
منابع مشابه
Probabilistic inference of bifurcations in single-cell data using a hierarchical mixture of factor analysers
Modelling bifurcations in single-cell transcriptomics data has become an increasingly popular field of research. Several methods have been proposed to infer bifurcation structure from such data but all rely on heuristic non-probabilistic inference. Here we propose the first generative, fully probabilistic model for such inference based on a Bayesian hierarchical mixture of factor analysers. Our...
متن کاملA low-cost variational-Bayes technique for merging mixtures of probabilistic principal component analyzers
Mixtures of probabilistic principal component analyzers (MPPCA) have shown effective for modeling high-dimensional data sets living on nonlinear manifolds. Briefly stated, they conduct mixture model estimation and dimensionality reduction through a single process. This paper makes two contributions: first, we disclose a Bayesian technique for estimating such mixture models. Then, assuming sever...
متن کاملP-30: The Investigation of Transcript Expression Level of Mitochondrial Transcription Factor A (TFAM) during In Vitro Maturation (IVM) in Single Human Oocytes
Background In vitro maturation (IVM) of human oocytes has acquired increasing attention in infertility treatment with great promise. This technique is an alternative conventional in vitro fertilization-embryo transfer (IVF-ET), and can be reduced the side effects of gonadotropin stimulation such as ovarian hyperstimulation (OHSS). Oocyte maturation is a complex process including cytoplasmic and...
متن کاملHidden Markov Bayesian Principal Component Analysis Hidden Markov Bayesian Principal Component Analysis
Probabilistic Principal Component Analysis is a reformulation of the common multivariate analysis technique known as Principal Component Analysis. It employs a latent variable model framework similar to factor analysis allowing to establish a maximum likelihood solution for the parameters that comprise the model. One of the main assumptions of Probabilistic Principal Component Analysis is that ...
متن کاملEffect of different concentrations of leukemia inhibitory factor on gene expression of vascular endothelial growth factor-A in trophoblast Tumor Cell Line
Background: Several studies have shown that leukemia inhibitory factor (LIF) is one of the most important cytokinesparticipating in the process of embryo implantation and pregnancy, while, the role of this factor on vascular endothelialfactor-A (VEGF-A), as one of the most important angiogenic factor, has not been fully investigated yet. The aimof this study was to evaluate th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 2 شماره
صفحات -
تاریخ انتشار 2017